A Survey of Binary Code Similarity
نویسندگان
چکیده
Binary code similarityapproaches compare two or more pieces of binary to identify their similarities and differences. The ability enables many real-world applications on scenarios where source may not be available such as patch analysis, bug search, malware detection analysis. Over the past 22 years numerous similarity approaches have been proposed, but research area has yet systematically analyzed. This article presents first survey similarity. It analyzes 70 approaches, which are systematized four aspects: (1) they enable, (2) approach characteristics, (3) how implemented, (4) benchmarks methodologies used evaluate them. In addition, discusses scope origins area, its evolution over decades, challenges that lie ahead.
منابع مشابه
A Survey of Binary Similarity and Distance Measures
The binary feature vector is one of the most common representations of patterns and measuring similarity and distance measures play a critical role in many problems such as clustering, classification, etc. Ever since Jaccard proposed a similarity measure to classify ecological species in 1901, numerous binary similarity and distance measures have been proposed in various fields. Applying approp...
متن کاملSimilarity measures for binary and numerical data: a survey
Similarity measures aim at quantifying the extent to which objects resemble each other. Many techniques in data mining, data analysis or information retrieval require a similarity measure, and selecting an appropriate measure for a given problem is a difficult task. In this paper, the diverse forms similarity measures can take are examined, as well as their relationships and respective properti...
متن کاملA Survey on Tools for Binary Code Analysis
Different strategies for binary analysis are widely used in systems dealing with software maintenance and system security. Binary code is self-contained; though it is easy to execute, it is not easy to read and understand. Binary analysis tools are useful in software maintenance because the binary of software has all the information necessary to recover the source code. It is also incredibly im...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Computing Surveys
سال: 2021
ISSN: ['0360-0300', '1557-7341']
DOI: https://doi.org/10.1145/3446371